Prielhofer et al. Microbial Cell Factories 2013, 12:5 
http://www.microbialcellfactories.eom/content/12/1/5 



RESEARCH Open Access 



Induction without methanol: novel regulated 
promoters enable high-level expression in Pichia 
pastoris 

Roland Prielhofer 1 , Michael Maurer 2,3 , Joachim Klein 4 , Jana Wenger 4 , Christoph Kiziak 4 , Brigitte Gasser 1,2 * 
and Diethard Mattanovich 1,2 




MICROBIAL CELL 
FACTORIES 



Abstract 

Background: Inducible high-level expression is favoured for recombinant protein production in Pichia pastoris. 
Therefore, novel regulated promoters are desired, ideally repressing heterologous gene expression during initial 
growth and enabling it in the production phase. In a typical large scale fed-batch culture repression is desired 
during the batch phase where cells grow on a surplus of e.g. glycerol, while heterologous gene expression should 
be active in the feed phase under carbon (e.g. glucose) limitation. 

Results: DNA microarray analysis of P. pastoris wild type cells growing in glycerol-based batch and glucose-based 
fed batch was used for the identification of genes with both, strong repression on glycerol and high-level 
expression in the feed phase. Six novel glucose-limit inducible promoters were successfully applied to express the 
intracellular reporter eGFP. The highest expression levels together with strong repression in pre-culture were 
achieved with the novel promoters P G1 and P G6 . 

Human serum albumin (HSA) was used to characterize the promoters with an industrially relevant secreted protein. 
A P G1 clone with two gene copies reached about 230% of the biomass specific HSA titer in glucose-based fed 
batch fermentation compared to a P G ap clone with identical gene copy number, while P G6 only achieved 39%. Two 
clones each carrying eleven gene copies, expressing HSA under control of P G1 and P G6 respectively were generated 
by post-transformational vector amplification. They produced about 1.0 and 0.7 g L" 1 HSA respectively in equal fed 
batch processes. The suitability in production processes was also verified with HyHEL antibody Fab fragment for P G1 
and with porcine carboxypeptidase B for P G6 . Moreover, the molecular function of the gene under the control of 
P G1 was determined to encode a high-affinity glucose transporter and named GTH1. 

Conclusions: A set of novel regulated promoters, enabling induction without methanol, was successfully identified 
by using DNA microarrays and shown to be suitable for high level expression of recombinant proteins in 
glucose-based protein production processes. 
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Background 

The methylotrophic yeast Pichia pastoris is widely used 
as a production platform for heterologous proteins. 
Latest developments in strain engineering for improved 
protein folding and secretion and glyco-engineering have 
recently been reviewed by Damasceno et al [1]. 

Another important target for strain development is 
the promoter driving expression of the heterologous gene. 
A summary of the most important promoters of non- 
methylotrophic and methylotrophic yeasts is provided 
by Mattanovich et al. [2]. While production of recombin- 
ant proteins in P. pastoris has been successfully achieved 
under control of the constitutive glyceraldehyde-3- 
phosphate dehydrogenase promoter (Pgap)> regulated 
promoters have several advantages: they enable initial 
biomass gain without product formation and allow tuning 
of the production process. Additionally, a potential impact 
of product accumulation on growth or viability of the cells 
can be prevented by decoupling growth from the 
production phase. 

However, todays available regulated promoters of 
P. pastoris have drawbacks. Many of them derive from 
methanol utilization pathway genes, which are generally 
repressed by glucose and/or ethanol and strongly induced 
by methanol. V AO xi induces high-level expression of its 
encoded alcohol oxidase 1, which catalyzes the oxidation 
of methanol to formaldehyde [3]. Its weaker homolog 
Vaox2 has been used for protein production as well [4]. 
Another strong promoter of this pathway is V FLDh its gene 
formaldehyde dehydrogenase is either induced by methyl- 
amine or methanol [5] . The promoter of dihydroxyacetone 
synthase, V DAS) was reported to be similarly regulated and 
even stronger than V AOX i [3], however it is not com- 
monly used for protein production. Methanol and 
methylamine are both highly flammable and hazardous 
to health, so safety precautions are required for their 
industrial use. In addition to that, methanol consump- 
tion is technically disadvantageous because it causes 
high heat evolution and an increased oxygen demand 
during the fed batch phase [6]. 

The P/czj promoter controls the expression of isocitrate 
lyase and is regulated by the carbon source used for cell 
growth. No detectable promoter activity is present when 
cells are growing on glucose, while it gets turned on when 
cells are stationary or growing on ethanol [7]. Hence, this 
promoter might be an alternative for some applications, 
but its regulatory properties are poor. PH089 is a regu- 
lated sodium phosphate symporter and its promoter was 
investigated and shown to produce reasonable amounts of 
protein [8]. Cells must be phosphate-limited for the full 
activation of V PH089) and an increase in product titer was 
even shown in phosphate-limited stationary phase. Add- 
itionally, an impact on growth was reported and reduced 
cellular fitness can be assumed in these conditions. 



On the other hand, constitutive promoters might be 
advantageous for the over-expression of genes or to 
co-express helper factors and marker genes. The 
widely-used Pgap controls the expression of glyceral- 
dehyde-3-phosphate dehydrogenase at a high basal 
level [9]. Its productivity can be influenced by control- 
ling the growth rate at the optimal activity of Pgap 
[10], and by a decrease of available 0 2 levels [11]. The 
promoter of the translational elongation factor EF-1 
alpha gene, V TEF1) was analyzed and showed a tighter 
growth-associated regulation than Pgap [12]. 

A promoter library of V AOX i was generated, leading to 
a few variants that were slightly stronger than wild type 
¥aoxi> and a number of variants with altered regulatory 
properties, some of them being active without methanol 
[13]. Another library approach was done for Pgap by 
mutation and clones expressing yeast-enhanced green 
fluorescent protein (yEGFP) under the control of 
obtained variants produced 8 to 218% of fluorescence in- 
tensity compared to the wild type promoter [14]. 
Potential promoter libraries can also be deduced from 
microarray data and rational considerations. Focussing 
on highly transcribed genes in general, 15 promoters 
were selected for characterization and the promoter of 
the thiamine biosynthesis gene V THI11) which is regulated 
by the availability of thiamine in the growth medium, 
was discovered [15]. 

As described above, the number of strong promoters 
with advantageous properties for protein production is 
limited in P. pastoris. This work was designated to 
identify novel promoters with both, high expression and 
an optimal regulation in production process conditions. 
Equally important, the addition of inducers was to be 
avoided, because their use is often associated with extra 
costs and safety precautions in large scale fermentation 
processes. 

A typical production process under the control of 
Pgap uses glycerol in the batch phase, and a constant 
glucose fed batch for 100 hours to reach more than 100 
g L' 1 cell dry weight [16]. In order to identify potential 
inducible promoters in the course of this process, we 
used DNA microarray analysis to compare gene expres- 
sion patterns of glycerol-excess (=batch growth phase) 
and glucose-limited (=fed batch production phase) con- 
ditions. The expression capacity of selected promoter 
targets was characterized with model proteins and veri- 
fied in fed batch processes. 

Results and discussion 

Identification of novel promoters with desired induction 
properties 

A typical P. pastoris protein production process avoiding 
methanol induction starts with a glycerol batch (surplus 
of carbon source) which is followed by a glucose fed 
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batch (limit of carbon source) [10]. DNA microarrays 
were used to analyze gene expression patterns and to 
identify potential promoters for this cultivation strategy. 
In order to eliminate growth rate related effects, glucose- 
limited conditions were analyzed in chemostat cultivation 
where the growth rate, similar to that in the batch phase, 
was fixed by controlling the dilution rate at 0.1 h" 1 . 

The microarray data was mined for genes with both, 
high difference in expression level between repressed 
and induced state (fold change) as well as high signal in- 
tensity in the induced state to identify potent promoters 
for inducible high-level protein production in P. pastoris. 
Six potential promoters (abbreviated as P G1 , P G 3, Pg4> 
Pg6> Pg7 and P G8 , see Figure 1 and Table 1) were consid- 
ered for further characterization. 

Verification of promoter strength and regulation 

At first, the strength and regulation of the novel promo- 
ters were assayed with the intracellular reporter protein 
eGFP in small scale screening cultures. Both, repressive 
conditions in pre-culture (glycerol excess) and induced 
ones during main culture (glucose limit) were analyzed 
during the screening. In order to simulate fed batch like 
conditions in screenings, we had to adapt the screening 
strategy. Instead of usual feedings with certain amounts 
of glucose which lead to repeated batch phases, we used 
slow glucose releasing polymer particles (12mm feed 
beads, Kuhner, CH), liberating glucose at a non-linear 
rate of 1.63 • t 0 ' 74 mg per disc (t = time [h]), which 
equals to 28.6 mg per disc after 48 hours. 

As shown in Figure 2, P G1 and P G 6 had superior prop- 
erties in terms of both, regulation and induction 
strength. In order to visualize gene dosage effects, gen- 
omic DNA of several clones was isolated and analyzed 
by real-time PCR to determine the gene copy number 
(GCN) of eGFP. Compared to a Pgap clone with one 
gene copy, the specific fluorescence of P G i and P G6 
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Figure 1 Microarray data (red channel) of identified target 
genes in comparison to GAP. Bars represent relative expression 
levels in glycerol excess (batch phase, blue bars on the left side) and 
in glucose limit (chemostat cultivation, red bars on the right side). 
Numbers in the right column represent the fold change of signal 
intensity between glucose limit and glycerol excess conditions. 
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Table 1 Identified promoter candidates 



r rurnuici 


Gene 


r. fJUbLUrib ycllc lUcrlllll Icfl ^bllcilll VJO 1 ID) 


Pgap 


GAP 


PAS_chr2-1_0437 


Pgi 


G1 


PAS_chr1-3_0011 


P G 3 


G3 


PAS_chr4_0550 


Pg4 


G4 


PAS_chr4_0043 


Pg6 


G6 


PAS_chr2-1_0853 


PG7 


G7 


PAS_chr1-4_0570 


Pg8 


G8 


PAS_chr1-3_0165 



controlled expression of eGFP (normalized to GCN) 
were induced from almost zero in batch phase to about 
150% and 100% after 48 h screening culture, respect- 
ively. The other promoters P G3 , P G 4> Pg7> and P G 8 still 
showed a good regulation and induction strength suit- 
able for inducible protein expression, with expression 
strengths spanning a spectrum of about 20% to 120% 
relative to Pgap (Figure 2A). The next step was to inves- 
tigate the induction behaviour of the novel promoters in 
more detail. 

Analysis of the glucose dependent regulation 

The induction behaviour of the novel promoters was 
characterized in screenings with eGFP producing clones 
in YP media containing different amounts of glucose 
(ranging from 20 to 0.002 g I/ 1 ). The cells were culti- 
vated for 5-6 hours and eGFP expression was analyzed 
by flow cytometry. 

Promoters P G i and P G7 showed a flat induction course 
leading to full activity only with less than 0.05 gL" 1 glu- 
cose. That is clearly different to P G3 , P G4 and P G6 's stee- 
per regulation pattern which reach their top activity 
already at around 4 g L" 1 glucose (Figure 3). In other 
words, P G1 is not only the strongest but also most tightly 
repressed by glucose among the promoters tested here. 

Based on these regulatory features we intended to 
characterize the functions of the genes under control 
of the P G promoters. At the time of their identifica- 
tion, no or only putative functions were assigned to 
the underlying genes aside from Gl. Therefore, we 
used NCBI Conserved Domain search to analyze the 
protein sequences in order to identify putative gene 
functions. 

The gene under the control of P G1 was previously 
functionally clustered with K. lactis high-affinity glucose 
transporter HGT1 [17]. It contains two major facilitator 
superfamily domains, same as the G7 gene, which is 
therefore assumed to be a glucose transporter too. For 
Saccharomyces cerevisiae, it was reported that hexose 
transporters underlie complex regulation patterns and 
are expressed in dependence of glucose concentration 
[18]. The regulation pattern exhibited by the promoters 
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Figure 2 Expression of eGFP under control of the novel 
promoters P G i, P G 3/ Pg4/ Pgs/ Pg? and P G8 . (A) Specific eGFP 
fluorescence in shake flask screenings related to P G ap and to eGFP 
gene copy number. (B) Fed batch cultivations of single gene copy 
clones expressing eGFP under the control of P G ap and P G1 . Relative 
eGFP expression (solid lines) and OD 600 (dashed lines) are shown 
over the feed time. 



P G3 , Pg4> Pg6 an d Pgs might be associated with a role in 
central metabolism. An AKR (aldo keto reductase) do- 
main was found in the gene expressed under P G3 . The 
genes controlled by P G4 and P G6 are both putative 
aldehyde dehydrogenases, predicted to be localized in 
the cytosol and in the mitochondria, respectively. The 
Gtil/Pac2 family domain found in G8 plays a role in 
gluconate uptake upon glucose starvation and in sexual 
development in Schizosaccharomyces pombe. 

We could show the need of an explicit glucose limit 
for full activity of the novel promoters, which is most 
pronounced for P G1 . This demonstrates that the micro- 
array data-based promoter selection is excellently suited 
to select for promoters with features relevant for biopro- 
cesses, and secondly indicates the novel promoter's 
advantages in fed batch fermentation. 

To prove this statement, the application of the strongest 
and most promising promoter P G1 was tested in a fed 
batch fermentation where truly glucose-limited conditions 
are present [10]. A single gene copy clone expressing 
eGFP was chosen for comparison to an equivalent single 
gene copy clone of eGFP under the control of P G ap- 



Thereby we could show that the P G1 promoter remains 
repressed during the batch phase and that its induction 
during fed batch clearly exceeds the strength of P G ap- 
Relative eGFP expression (fluorescence related to the 
culture volume) and OD 600 over the feed time are shown 
in Figure 2B. 



Expression of secreted human serum albumin 

P G i and P G6 were further selected to assay the produc- 
tion of a secreted protein under their control. Human 
serum albumin (HSA) is efficiently produced in P. pas- 
toris. Therefore it can be regarded as an industrially 
relevant secreted reporter protein. Its expression under 
the control of P G1 , P G6 and P G ap was screened in shake 
flasks with glucose-limited conditions (through the use 
of feed beads) in the main culture. During the glycerol 
(batch) pre-culture, both P G1 and P G6 promoters remained 
well repressed. As seen before, during main culture expres- 
sion under P G1 was stronger than under P G6 and, in rela- 
tion to gene copy number, reached around 77% of the 
biomass specific HSA yield of cells expressing under P G ad 
while P G6 produced about 22% of P GA p (Figure 4A). How- 
ever, the novel promoters might not show their full poten- 
tial in shake flask screenings, since these conditions are not 
strictly glucose limited during the entire production phase. 

To exploit the full potential of the novel promoters, glu- 
cose limited fed batch cultivations were performed. Based 
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Figure 3 Induction behaviour of the novel promoters. Specific 
eGFP fluorescence of clones expressing eGFP under the control of 
Pd/ p G3, Pg4, Pee and P G7 in media containing different amounts of 
glucose. Data is related to P G ap/ normalized to 1.0 at the highest 
glucose concentration of 20 g L" 1 and plotted against the 
logarithmic glucose concentration (trend line calculation: four 
parameter logistic curve). The glucose concentration given on the x- 
axis refers to the glucose set point at the beginning of the 
cultivation (serial dilutions ranging from 20 to 0.002 g L" 1 ). This 
screening setup and data processing points out relative promoter 
activities, thereby showing the kinetics of induction, but does not 
allow comparison of promoter strength. 
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Figure 4 Expression of secreted HSA using the novel promoters P G i and P G6 in shake flask and fed batch cultivations. (A) HSA 

expression in shake flask screenings related to P G ap and to the gene copy number. (B) Dry cell weight and (C) HSA titer in fed batch cultivations 
of double and single gene copy clones expressing under the control of P G1 (circle, two copies), P G6 (diamond, one copy) and P GA p (black square, 
two copies and black-and-white, one copy). (D) Detail of (C) showing late batch and early fed batch phase, highlighting the different regulation 
properties of the promoters. Except for the single gene copy P GA p clone, all fermentations were performed in duplicates. 



on the screening results, one clone each expressing suffi- 
cient amounts of HSA under the control of P G i and P G 6 
was selected. Those clones, harbouring two and one gene 
copies respectively, were compared with their respective 
gene copy equivalent P G ap clones (Figure 4B, 4C and 4D). 
Dry cell weight (DCW) and HSA titers are summarized in 
Table 2. HSA titers of P GAP clones correlate with their re- 
spective gene copy number. Again, the P G1 clone showed 
superior properties - it clearly outperformed the P GAP clone 
with the same gene copy number and produced about 
230% of the biomass specific product yield compared to 
Pgap- Pg6 produced about 39% of the biomass specific 
HSA yield of its gene copy equivalent P GAP clone. Besides, 



PGAP-driven expression was already active in the batch 
phase, and more than 5% of the final HSA amount was 
already present at the batch end for both clones expressing 
under its control. While this is not an issue in case of HSA, 
it is a clear disadvantage compared to inducible promoters 
such as P G1 in case of toxic or difficult to express products. 
Figure 4C shows HSA titer over the feed time, and the 
unique repression/induction efficiency of P G1 is clearly 
pointed out in the first hours (Figure 4D). Both, P G1 and 
P G6 showed good repression in the batch phase and induc- 
tion by the glucose limited feed. 

To verify that the novel promoters also exhibit their 
superior regulatory properties and expression capacity 



Table 2 Summary of fed batch cultivations of P. pastoris expressing HSA under the control of P G i, Pee and Pgap 



Batch end Fed batch end 



Promoter 


GCN 


DCW 


HSA 


HSA/DCW 


DCW 


HSA 


HSA/DCW 


% HSA/DCW 






[g L" 1 ] 


[mg L" 1 ] 


[mg g" 1 ] 


[g L" 1 ] 


[mg L 1 ] 


[mg g" 1 ] 


of Pgap 


Pgt 


2 


24.3 


0.5 


0.0 


126.9 


303.1 


2.4 


231.2 


PG6* 


1 


23.9 


0.3 


0.0 


127.1 


24.3 


0.2 


38.9 


Pgap* 


2 


23.8 


10.1 


0.4 


123.7 


128.2 


1.0 




Pgap 


1 


24.2 


5.0 


0.2 


117.7 


57.8 


0.5 





* performed in duplicates. 
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Table 3 Summary of fed batch cultivations of GCN amplified HSA expressing clones under the control of P G i and P G e 









Batch end 






Fed batch end 




Promoter 


GCN 


DCW 


HSA 


HSA/DCW 


DCW 


HSA 


HSA/DCW 






[g L" 1 ] 


[mg L 1 ] 


[mg g" 1 ] 


[g L" 1 ] 


[mg L 1 ] 


[mg g" 1 ] 


Pgi 


11 


18.9 


0.2 


0.0 


114.0 


1060.8 


9.3 


Pg6* 


11 


22.5 


0.3 


0.0 


110.8 


728.7 


6.6 



* performed in duplicates. 



in industrially relevant conditions, we elevated HSA 
gene copy number by post-transformational vector 
amplification as described previously [19]. Thereby, 
we were able to produce more than 1 g L" 1 HSA 
under the control of P G i with a clone harbouring 11 
gene copies, which corresponds to the 3.4-fold titer 
of its two copy clone (Table 3). Again, P G1 outper- 
formed a comparable clone with the same gene copy 
number under the control of the Pgap promoter, 
which produced 607 mg L" 1 HSA in a similar fermen- 
tation [19]. High level HSA production was also 
achieved with an amplified clone expressing HSA (11 
gene copies as well) under the control of the weaker 
P G 6 promoter, which produced more than 720 mg L" 1 
HSA (Table 3). This titer is approximately 30-fold 
higher than the titer reached with the P G6 single copy 
clone (24 mg I/ 1 ), thus indicating that multiple copies 
of expression cassettes under control of a weaker pro- 
moter can also lead to high productivities. One 
possible explanation of this effect could be that the ratio 
of a repressing protein to promoter copy number and 
thus repressor binding sites is decreased in the ampli- 
fied clones, therefore leading to higher transcription. 

Additionally, the porcine enzyme carboxypeptidase 
B (CpB) that is used for human insulin production, and an 
antibody Fab fragment were produced under the 
control of P G1 and P G6 respectively, and exceeded the 
production levels compared to Pgap- Thereby we verified 
again the suitability of these promoters in standard 
glucose based production processes. 

P G1 activity depends on specific growth rate 

We elucidated the expression activity of P G1 at different 
growth rates using an HSA clone with two gene copies 
under its control. It was cultivated in chemostat with 
different dilution rates and the highest specific product 
formation was found at a growth rate of about 0.07 h" 1 
(Figure 5). This clearly differs to the profile obtained 
with P G ap in [10], where the highest specific product 
formation was obtained only at higher growth rates. 
Growth rate dependency may be utilized to optimize 
space-time yield or other parameters in the production 
processes [10]. 



Knock out of G1 

Furthermore, we decided to clarify the function of the gene 
PAS_chrl-3_0011, which underlies the control of the 
promoter P G1 . It contains 12 transmembrane domains 
(predicted by TMHMM Server v. 2.0), two Major Facilita- 
tor Superfamily (MFS) and other transporter domains. 
Based on the sequence homology to the K. lactis high- 
affinity glucose transporter HGT1, the gene controlled by 
P G1 was expected to have a function in glucose transport 
[17], [20]. Strong activity of its promoter at very low glucose 
concentrations further strengthened this assumption. For 
further verification, the gene was disrupted using the split 
marker cassette technique (primers given in Additional file 
1: Table SI) as described by Heiss et al. [21]. Similar as 
described by Jorgensen and his colleagues [22], we com- 
pared the glucose uptake of the wild type and a Gl knock 
out clone in glucose-limited chemostat cultivations at dif- 
ferent growth rates. The glucose saturation constants were 
calculated from the residual glucose concentrations 
(Table 4) and a K s of 9.7, 23.1 and 69.3 uM was obtained 
for three different dilution rates (|i=0.14, 0.1 and 0.05 h" 1 ) 
for the wild type. Changing K s values are observed for the 
whole cell in different conditions, which is due to the differ- 
ential regulation of its several transporters. A reduced 
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Figure 5 Correlation of specific productivity to specific growth 
rate using P G1 . Specific product formation rate (q p ) observed in 
chemostat cultivation at different dilution rates of a clone expressing 
HSA under the control of P G1 as well as the respective trend curve 
(spline curve). 
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Table 4 Glucose substrate saturation constants of a 



wildtype and a G1 knock out clone 





G1 k. o., 


Umax = 0.18 h _1 


Wildtype, Umax = 


0.18 h' 1 


D [h" 1 ] 


S[uM] 


X [g L" 1 ] 


K s [MM] 


S[uM] 


X [g L" 1 ] 


K s [MM] 


0.14 


316.4 


29.5 


90.4 


33.9 


30.5 


9.7 


0.10 


123.8 


31.3 


99.0 


28.9 


32.1 


23.1 


0.05 


79.9 


31.3 


207.8 


26.6 


30.8 


69.3 



Residual glucose (S), dry cell weight (X) and substrate saturation constant (K s ) 
of a wild type and a G1 knock out (k. o.) clone at different dilution rates in 
chemostat cultivations. 



capacity of glucose uptake at low specific growth rates has 
been reported before [22]. The Gl knock out clone 
appeared to have much higher saturation constants of 90.4, 
99.0 and 207.8 \iM at the same dilution rates, which was 
also described for the high-affinity glucose transporter dis- 
ruption in Aspergillus niger [22]. The knock out clone does 
not display the low K s values of the wild type, so that the 
gene PAS_chrl-3_0011 was clearly identified as a high- 
affinity glucose transporter. As the short name HGT1 is 
used as an alias for a peptide transporter in S. cerevisiae we 
suggest the short name GTH1 (glucose transporter with 
high affinity) for this P. pastoris gene. 

Conclusions 

Efficient regulated promoters cannot necessarily be 
found by classical batch screening approaches. Simulat- 
ing production conditions in lab scale and searching the 
promoter space offers a new target oriented approach. 
We could show here that the cultivation of P. pastoris in 
conditions where repression or induction are desired, 
followed by the analysis of transcript levels with DNA 
microarrays offers a potent opportunity to find new, 
strong and regulated promoters. 

Six novel promoters were identified and further charac- 
terized. All of them are activated by carbon source deple- 
tion. The new promoters provide a tool box for 
expression of recombinant genes and are thus well suit- 
able for protein production processes. P G i had the most 
favourable repression kinetics and exceeded the expres- 
sion levels of the well-established constitutive GAP pro- 
moter in glucose limited fed batch cultures by more than 
twofold. The molecular function of the gene under its 
control was identified as high-affinity glucose transporter 
and named GTH1. 



Materials and methods 

Strains and cultivation 

Escherichia coli DH10B (Invitrogen) was used for 
subcloning. It was routinely cultivated in petri dishes or 
shake flasks using LB media supplemented with 25 ug 
ml/ 1 Zeocin. A wild type Pichia pastoris strain 



CBS2612 which can grow on minimal media supple- 
mented with biotin, was used for protein expression in 
this work. 

The main culture for screenings was either done with YP 
or BM media and glucose feed beads (12 mm, Kuhner, CH) 
which provided the carbon source. 

YP media contained 20 g L" 1 peptone and 10 g L" 1 
yeast extract, which can be supplemented with 12.6 g 
glycerol or 20 g glucose to obtain YPG and YPD, 
respectively. For cultivation on plates, 5 g L" 1 agar-agar 
was added to the liquid medium. BM media was based 
on YP, supplemented with 13.4 g L" 1 yeast nitrogen base 
(Cat.No. 291940, Becton Dickinson, FR) with ammo- 
nium sulfate, 0.4 mg L" 1 biotin and 100 mM potassium 
phosphate buffer pH 6.0. 

Identification of novel inducible promoters 

a) Bioreactor cultivations 

Fermentations for the identification of promoter 
candidates were done in 3.5 L working volume 
bioreactors (Minifors, Infers, CH) in three biological 
replicates. Cells were grown for about 24 h in batch on 
glycerol medium, followed by an exponential feed phase 
on glycerol fed batch medium calculated as described by 
Resina et al. [23] with a specific growth rate of [i= 0.1 h" 1 
and a substrate yield coefficient of Y X /s of 0.5 g g" 1 . 
Sequentially, chemostat cultivation (D = u = 0.1 h" 1 ) 
with high density glucose medium was performed. 
Glycerol batch medium contained per liter: 2 g citric 
acid monohydrate, 39.2 g glycerol, 20.8 g NH 4 H 2 P0 4 , 
0.5 g MgS0 4 - 7H 2 0, 1.6 g KC1, 0.022 g CaCl 2 - 2H 2 0, 
0.8 mg biotin and 4.6 mL PTM1 trace salts stock 
solution. HC1 was added to set the pH to 5.0. 
Glycerol fed-batch medium contained per liter: 632 g 
glycerol, 8 g MgS0 4 - 7H 2 0, 22 g KC1, and 0.058 g 
CaCl 2 . 2H 2 0. 

High-density chemostat medium contained per liter: 2 g 
citric acid monohydrate, 99.42 g glucose monohydrate, 
22 g NH 4 H 2 P0 4 , 1.3 g MgS0 4 - 7H 2 0, 3.4 g KC1, 0.02 g 
CaCl 2 . 2H 2 0, 0.4 mg biotin and 3.2 mL PTM1 trace 
salts stock solution. HC1 was added to set the pH to 5.0. 
PTMi trace salts stock solution contained per liter: 
6.0 g CuS0 4 . 5H 2 0, 0.08 g Nal, 3.36 g MnS0 4 - 
H 2 0, 0.2 g Na 2 Mo0 4 . 2H 2 0, 0.02 g H 3 B0 3 , 0.82 g 
CoCl 2 , 20.0 g ZnCl 2 , 65.0 g FeS0 4 - 7H 2 0, 0.2 g 
biotin and 5.0 mL H 2 S0 4 (9596-98%). 

b) Microarray hybridization 

RNA purification and sample preparation as well as 
microarray hybridization (in-house designed P. pastoris 
specific oligonucleotide arrays, AMAD-ID: 034821, 
8x1 5K custom arrays, Agilent) and data analysis were 
done as described by Graf et al. [24]. 
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Characterization of promoter strength and regulation 

a) Cloning 

Cloning and transformation was done using the in- 
house vector pPuzzle [15], which contains a Zeocin 
resistance cassette for selection in both E. coli and 
yeast, an expression cassette for the gene of interest 
(GOI) consisting of a multiple cloning site and the 
S. cerevisiae CYC1 transcription terminator, and a 
locus for integration into the P. pastoris genome 
(3 ' AOX1 region or rDNA locus). Promoter 
sequences (up to 1000 bps upstream of the start 
codon of their respective genes) were PCR-amplified 
from P. pastoris genomic DNA (primer sequences 
see Additional file 1: Table SI). The promoters were 
ligated into pPuzzle in front of the start codons of the 
model proteins, using the Apal and the Sbfl 
restriction sites of the multiple cloning site of the 
vector. Vectors expressing the respective model protein 
under control of P G ap were used as controls 
throughout the study. For the expression of 
heterodimeric HyHEL antibody Fab fragment (HyHEL 
Fab), the expression cassettes of light chain and Fab 
heavy chain (each under control of P G1 ) were combined 
into one vector (using the strategy described in [27]). 
HSA was secreted by its native secretion leader, 
while for CpB and HyHEL Fab the S. cerevisiae alpha 
mating factor signal sequence was used. To avoid 
positional effects on reporter gene expression levels, 
genome integration of the expression plasmids was 
targeted to either the 3 'flanking region of the AOX1 
gene or the ribosomal DNA locus (rDNA, for 
multicopy integration) of P. pastoris, respectively. 
Plasmids were linearized within the genome integration 
region prior to electroporation (2 kV, 4 ms, GenePulser, 
BioRad) into electrocompetent P. pastoris. Multicopy 
integration of HSA expressing clones was done as 
described by Marx et al. [19] and selected at higher 
Zeocin concentrations (up to 1000 ug mL" 1 ). 
P. pastoris cells were first selected and cultivated in 
petri dishes on YPD agar and then inoculated in an 
YPG medium as pre-culture for screenings and 
fermentations. Antibiotic selection by Zeocin was 
applied on plates and in pre-culture at a concentration 
of 25 |ig mL" 1 or higher. 

b) Expression screening 

Expression of intracellular eGFP and the secreted 
proteins HSA, CpB and HyHEL Fab with the novel 
promoters in comparison to P G ap was evaluated in shake 
flask screenings. All screenings were performed at 25°C 
and with shaking at 180 rpm. Single colonies were 
inoculated in YPG medium with selection pressure 
(Zeocin) for pre-culture. After approximately 24 hours, 



the pre-culture was used to inoculate the main culture 
with an optical density (OD 60 o) of 0.1 (for eGFP) or 1 
(for HSA, CpB and HyHEL Fab) in 10 mL YP or BM 
medium, respectively. Glucose feed beads (12 mm, 
Kuhner, CH) were used to generate glucose-limiting 
growth conditions. Expression of eGFP was measured at 
the end of pre-culture and at 24 and 48 hours of the 
main culture. Culture supernatant of clones expressing 
secreted protein was harvested from the pre-culture and 
after 48 hours and cell density was determined by 
measuring wet cell weight or OD 60 o 

c) Comparative analysis of P. pastoris promoter activity 
In order to analyze relative transcription strength of 
the P G promoters at different glucose concentrations, a 
comparative promoter activity study using various 
glucose concentrations (ranging from 20 to 0.002 g L" 1 
glucose) was performed with eGFP expressing clones 
in 24-well plates (Cat. No. 7701-5110, Whatman, UK) 
covered with breath seal membranes (Cat. No. B-100, 
Excel Scientific, CA). Glucose concentrations of 20, 10, 
5, 2.5, 1.25, 0.63, 0.31, 0.16, 0.08, 0.04, 0.02, 0.01, 0.005, 
0.002 g L" 1 were obtained by serial dilution in YP 
media, and represent the inital setpoints. The main 
culture was inoculated from YPG-Zeocin pre-culture 
with an OD 60 o of 0.01 and samples were taken after 
5-6 hours and analyzed by flow cytometry. 

d) Fed batch cultivation 

All fed batch fermentations were done in 1.0 L working 
volume bioreactors (SR0700ODLS, DASGIP, DE). The 
dissolved oxygen was controlled at DO = 20% with the 
stirrer speed (400 - 1200 rpm). Aeration rate was 18 L 
h" 1 air, the temperature was controlled at 25°C and the 
pH was controlled at 5.85 for HSA [25] or pH 5.0 for 
the other proteins [10] with addition of ammonium 
hydroxide (25%). To start the fermentation, 300 mL 
batch medium was sterile filtered into the fermenter 
and a P. pastoris clone was inoculated from an 
overnight pre-culture with a starting optical density 
(OD 60 o) of 1. For the cultivation of clones expressing 
eGFP, the batch phase of approximately 25 h was 
followed by a fed batch phase with a feeding rate 
optimized according to [10]. HSA expressing strains 
were cultivated as described by Marx et al. [19], where 
the batch phase was followed by a constant feed of 2 g 
h" 1 fed batch medium for 100 h, Carboxypeptidase B 
and HyHEL Fab expressing clones were cultivated 
similarly. Samples were taken during batch and fed 
batch phase, and analyzed for expression. 
Glycerol batch and glucose fed batch media for eGFP, 
HyHEL Fab and Carboxypeptidase B expressing 
clones were exactly as described in [10], while for the 
production of HSA the media was described in [19]. 
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e) Chemostat cultivation 

A strain expressing HSA (2 GCN) under control of 
P G1 was tested for its growth rate dependent 
expression behaviour in chemostat at different dilution 
rates (D = 0.03, 0.06, 0.10, 0.13) [10]. 
For characterization of glucose uptake characteristics, 
the P. pastoris wild type strain and the strain deleted 
for PAS_chrl-3_0011 were cultivated in glucose 
limited chemostats at D = 0.05, 0.10, and 0.14 h~\ 
Samples were taken rapidly as described below. 

Analytical methods 

a) Copy number determination with real-time PCR 
Genomic DNA was isolated using the DNeasy 
Blood&Tissue Kit (Cat. No. 69504, Quiagen, DE). 
Gene copy numbers were determined with quantitative 
PCR using the SensiMix SYBR Kit (QT605-05, Bioline 
reagents, UK). The primers (supplementary Additional 
file 1: Table SI) and sample were mixed with the 
SensiMix and applied for real time analysis in a real- 
time PCR cycler (Rotor Gene, Qiagen, DE). All 
samples were analyzed in tri- or quadruplicates. Data 
analysis was performed with the two standard curve 
method of the Rotor Gene software. The actin gene 
ACT1 was used as calibrator. 

b) Determination of protein expression levels 

A plate reader (Infinite 200, Tecan, CH) was used to 
determine eGFP fluorescence in fermentation 
samples. Therefore, samples were diluted to an OD 600 
of 5 and fluorescence intensity was then related to 
the culture volume. 

Expression of eGFP in screenings was analyzed by 
flow cytometry as described before [15]. Specific 
eGFP fluorescence referred to in this study is the 
fluorescence intensity related to the cell volume for 
each data point as described by Hohenblum et al. 
[26]. Then the geometric mean of the population's 
specific fluorescence was normalized by subtracting 
background signal (of non-producing P. pastoris wild 
type cells) and related to expression under the 
control of Pqap- 

For quantification of HSA in shake flask and 
fermentation supernatants, the Human Albumin 
ELISA Quantitation Set (Cat. No. E80-129, Bethyl 
Laboratories, TX) was used. The HSA standard was 
applied with a starting concentration of 400 ng mL" 1 . 
Dilution-, Blocking- and Washing buffer were based on 
TBS (50 mM Tris-HCl, 140 mM NaCl, pH 8.0) and 
completed with BSA (1% (w/v)) and/or Tween20 
(0.05% (v/v)) accordingly. 

HyHEL Fab was determined with ELISA as described 
previously [27]. 



CpB was quantified using an enzymatic assay based 
on the cleavage of hippuryl-L-arginine (Cat. No. 
H2508, Sigma, MO). Generation of hippuric acid was 
monitored at its absorbance maximum of 254 nm. 
Prior to the measurement, the samples were desalted 
with Zeba Spin columns (Thermo Fisher Scientific, 
IL) and activated with trypsin (Cat. No. T8345, 
Sigma, MO). 

c) Determination of residual glucose 

The D-Glucose Assay - GOPOD-Format (Megazymes, 
IE) was used to determine residual glucose of chemostat 
samples. Supernatant sampling was done by pumping 
culture broth out of the bioreactor by producing an 
overpressure, and its direct sterile filtration using a 
vacuum filter unit (Cat. No. 5141178, Whatman, UK). 
Glucose-limited cultivations usually go along with very 
low residual glucose concentrations in the supernatant, 
so the manufacturer s protocol was adapted for glucose 
concentrations from 10 to 100 mg L" 1 . Briefly, the 
ratio of reaction buffer to sample was changed from 
30:1 to 3:1. 

Additional file 



Additional file 1: Table SI. Primer Sequences. Sequences of 
oligonucleotides used for amplification of promoters, determination of 
gene copy numbers of the model protein expression cassettes, and 
generation of G1 disruption cassette (including verification of positive 
knock-outs). 
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